CDS
Accession Number | TCMCG064C00194 |
gbkey | CDS |
Protein Id | XP_011082677.1 |
Location | join(1172849..1173364,1173629..1173841,1174227..1175104,1176293..1176590,1177177..1178121,1178840..1179111,1179374..1179875) |
Gene | LOC105165366 |
GeneID | 105165366 |
Organism | Sesamum indicum |
Protein
Length | 1207aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA268358 |
db_source | XM_011084375.2 |
Definition | DNA-directed RNA polymerases IV and V subunit 2 [Sesamum indicum] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGGTGCTCGGGATTTTCTTGAAGAAGCTGGACCTAGTAGCTTTAATGAGAAATTTTCTAATGGTTCCTTCAGAATGGATATTGACGATGACATGGACTTTGATTGTTCTGATTCGGACGATGAGCTTGACAGCTCCCTCCAAGATTTGGACGAGGGTTTTCTAAAGAGTTTCTGCAAGAAAGCCTCAACTGCGTTTTTTGACCAATATGGATTAATTAGTCATCAAATTAATTCATACAATGATTTCGTTAGAAACGGAATTCAAAAAGTGTTCGATTCTATTGGGGAGATCCTAATCGAGCCTGGATATGATCCATCAAAGAAGGGGGATGGTGATTGGAGGCGCGCATCTCTGAAGTTTGGGAAGGTCACCCTTGAGAAACCAAAATTTTGGACTGGTGAAAAGTTTTCATCTGTTGACGGTGCTAAGGAGTATTTGAATCTTTTACCCCGTCATGCTCGTCTTCAGAACATGACTTACTCATCTAGGATCAAAGTTGAGACTCATCTTGAGGTGTTCACCGAAAGTCTCTCCAGAAGTGACAAGTTCAAAACTGGTGTGGAAAAATTTATTGAGAAGACATTACTGCATGAGTATCATACTGATGTTAACTTTGGGAGACTCCCAGTTATGGTGAAGTCGGATTTATGTTGGATGAGTGAACTTGAAGAAAGGGATTGTGAATTCGAACAAGGAGGATATTTCGTGATTAAGGGGGCTGAGAAGACATTCATTGCACAGGAGCAAATATGCCTTAAGAGACTGTGGGTGGCCAAAAGTCCTGCTTGGACCATTTCATATCGGCCAGTTTCTAAACGAAAAAGAATTTTCGTAAAGCTAGTTCCAAAAGTAGAGCACATAACAGGAGGAGAAAAGATTCTCACGGTCTATTTTTACGTTACAGAAATTCCTATTTGGCTTTTGTTCTTTGCTTTGGGTGTTCCAAATGACAGAGAGGTGGTAAAGATGATTGATCTTGATGTAGAAGATTCAGCCATCTCTAACATACTTATTGCATCAATTTATGATGCTGACAGGAAGTATGAGGGTTTCCGTAAGGGAGGAAATGCACAAAAATTTATCAAGGAACTCATGCAGGGTTGTAGATTCCCACCTACAGAGTCAGTGGAAGAATGTATCACAAACTACCTCTTTCCCCATCTCAAAAGCCCAGAGCAGAAGGCTTGTTTTCTTGCTTATATGGTCAAATGTCTCTTGGAAGCTTATAGAGGGCGCCGCAAAGTTGATAACAGAGATGATCTTAGGAACAAGAGGTTGGAGTTGGCCGGTGAGCTACTTGAGCGCGAGCTAAAAGTTCATATCAAACATGCAGAGAGGCGAATGGTTAAAGCTATACAGAGAGACCTTTATAATGACCGAGAGGTGCAGTCTATTGACCATTACTTGGATGCCTCAATCATCACAAATGGTCTTTCAAGAGCTTTCTCAACTGGGGCTTGGTCGCACCCTTACAAGAGGATGGAAAGAATTTCTGGTATAGTGGCTACTATTAGGCGAACTAATCCACTGCAGGCAACAGCTGATATGAGGAAAACTCGCCAGCAAGTCTCATACACGGGCAGGGTTGGTGACGCTAGATACCCACACCCATCTCATTGGGGTAAGATTTGCTTCCTCTCGACGCCGGACGGAGAAAATTGTGGACTGGTTAAGAATCTGGCCAGCATGGGTCTTGTTAGTACTGATATCTTGAATCGGGAATCTCTTCTTCAAAAATTTTATGAATGTGGAATGGAAATGTTGGTTGATGATGCTTCAGCTCTGCTCAATGGGAAGCATAAAATTTTTCTTGATGGAGATTGGGTTGGAATCTGTAAAGATTCCTCATCGTTTGTTGCAAGGGTTAGACGCAAACGCCGCAAGACGGAAGTGCCACATCAGATTGAAATTAAAAGAGACAAGCATCATGGAGAAGTTCGTATTTTCGCTGATGCTGGAAGAATACTCCGTCCTCTCTTAATTGTTCAAAATTTGAGGAAAATCAAAGATTTGAAAGGAGATTTTTCATTTCAGTCACTTCTGGATAGTAGCATAATAGAGTTAATAGGTCCTGAAGAGGAAGAAGATTGCCAAACTGCATGGGGAGTAAGATACCTTTTCACAGCTGAATTAGAGAATCCACCAATCAAATACACACACTGTGAACTTGATAGTTCTTTTTTATTAGGACTAAGTTGTGGAATCATCCCCTTTGCAAATCATGATCATGCAAGGAGAGTTCTGTATCAGTCTGAGAAGCACTCTCAGCAGGCTATTGGGTTCTCAACCACAAATTCAAGCATTAGAGTAGATACAAACTCTCATCAATTGTACTACCCCCAGCGGCCACTTTTTAGAACCATGCTTTCAGATTGCCTTGGGAGATCAAAATATGATCACCATAAGGGCATGCTGCCACGACCTGAGTTTTTCAATGGCCAGTGTGCTATCGTGGCTGTCAATGTCCATCTCGGCTACAACCAAGAAGACTCCTTGGTAATGAATCGTGCTTCCTTGGAGCGTGGTATGTTTCGCTCTGAACACGTCCGGAGTTACAAAGCTGAGGTTGAAAATTCAGAAGCAGCTGGGAAAAAGGCGAAGACTGATGACTTGGTTAGCTTTGGAAAGATGCAAAGCAAGATTGGACGTGTTGATAGCCTTGACGATGATGGCTTTCCATACATTGGTGCGAATCTCCAAACTGGTGACATAGTCATTGGAAAGCATGCTGCATCCGGGGTCGATCATAGCATCAAGCTCAAGCACACTGAGAAGGGCATGGTTCAGAAGGTTGTCCTTTCCGCTAATGATGAGGGGAAGAACTTTGCTGTTGTATCATTGAGACAGGTTCGTTCTCCATGTCTTGGGGACAAGTTTTCTAGCATGCATGGGCAGAAGGGTGTGCTGGGGTTCCTGGAGTCCCAGGAAAACTTCCCTTTCACTAAACAAGGAATAGTTCCTGACATTGTGATAAACCCTCATGCATTTCCTTCTAGGCAAACGCCCGGTCAGCTCTTGGAGGCTGCGTTGGCCAAAGGGATTGCACTCGGGGGCGGTCTAAAATATGCCACCCCGTTTACCTCCCCATCAGTTGAAGATATAACAGCGCAGCTTCACAGGCTTGGATTTTCGAGATGGGGGGATGAGAGAGTTTACGATGGGCGAACTGGTGAAAAGGTCCAGTCCCTTATCTTTATGGGACCGACATTCTACCAGCGGCTCACACATATGGCCGAAGACAAAGTGAAATTCAGGAATACCGGGCCAGTTCACCCTCTTACTCGTCAGCCTGTGGCCGACAGGAAACGTTTCGGTGGAATCAAGTTTGGCGAGATGGAGCGAGATTGCCTCATAGCTCACGGTGCAGCAGCCAACCTACACGAACGTCTCTTCACCCTTAGTGATTCATCCCAAATGCATATATGCAGGAAATGCAAGAACATGGCCAATGTAATCCAGCGACCGGTGTTTGGTGGTCGGAAGATACGTGGGCCTTATTGCCGTTTCTGTGAGTCTGTGGAAGATGTGGTGAGGGTGAACGTGCCTTACGGGGCTAAGTTACTGTGTCAGGAGCTATTCAGCATGGGGATATCTCTCAAGTTTGATACTGAGCTATGTTGA |
Protein: MGARDFLEEAGPSSFNEKFSNGSFRMDIDDDMDFDCSDSDDELDSSLQDLDEGFLKSFCKKASTAFFDQYGLISHQINSYNDFVRNGIQKVFDSIGEILIEPGYDPSKKGDGDWRRASLKFGKVTLEKPKFWTGEKFSSVDGAKEYLNLLPRHARLQNMTYSSRIKVETHLEVFTESLSRSDKFKTGVEKFIEKTLLHEYHTDVNFGRLPVMVKSDLCWMSELEERDCEFEQGGYFVIKGAEKTFIAQEQICLKRLWVAKSPAWTISYRPVSKRKRIFVKLVPKVEHITGGEKILTVYFYVTEIPIWLLFFALGVPNDREVVKMIDLDVEDSAISNILIASIYDADRKYEGFRKGGNAQKFIKELMQGCRFPPTESVEECITNYLFPHLKSPEQKACFLAYMVKCLLEAYRGRRKVDNRDDLRNKRLELAGELLERELKVHIKHAERRMVKAIQRDLYNDREVQSIDHYLDASIITNGLSRAFSTGAWSHPYKRMERISGIVATIRRTNPLQATADMRKTRQQVSYTGRVGDARYPHPSHWGKICFLSTPDGENCGLVKNLASMGLVSTDILNRESLLQKFYECGMEMLVDDASALLNGKHKIFLDGDWVGICKDSSSFVARVRRKRRKTEVPHQIEIKRDKHHGEVRIFADAGRILRPLLIVQNLRKIKDLKGDFSFQSLLDSSIIELIGPEEEEDCQTAWGVRYLFTAELENPPIKYTHCELDSSFLLGLSCGIIPFANHDHARRVLYQSEKHSQQAIGFSTTNSSIRVDTNSHQLYYPQRPLFRTMLSDCLGRSKYDHHKGMLPRPEFFNGQCAIVAVNVHLGYNQEDSLVMNRASLERGMFRSEHVRSYKAEVENSEAAGKKAKTDDLVSFGKMQSKIGRVDSLDDDGFPYIGANLQTGDIVIGKHAASGVDHSIKLKHTEKGMVQKVVLSANDEGKNFAVVSLRQVRSPCLGDKFSSMHGQKGVLGFLESQENFPFTKQGIVPDIVINPHAFPSRQTPGQLLEAALAKGIALGGGLKYATPFTSPSVEDITAQLHRLGFSRWGDERVYDGRTGEKVQSLIFMGPTFYQRLTHMAEDKVKFRNTGPVHPLTRQPVADRKRFGGIKFGEMERDCLIAHGAAANLHERLFTLSDSSQMHICRKCKNMANVIQRPVFGGRKIRGPYCRFCESVEDVVRVNVPYGAKLLCQELFSMGISLKFDTELC |